Dataset statistics
| Number of variables | 11 |
|---|---|
| Number of observations | 22483955 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.8 GiB |
| Average record size in memory | 88.0 B |
Variable types
| Numeric | 11 |
|---|
LongitudAcc is highly correlated with Fuel Rate and 2 other fields | High correlation |
EngineSpeed is highly correlated with EngineAirInletPressure and 2 other fields | High correlation |
Fuel Rate is highly correlated with Engine Load and 2 other fields | High correlation |
Engine Load is highly correlated with Boost Pressure and 2 other fields | High correlation |
Boost Pressure is highly correlated with Engine Load and 2 other fields | High correlation |
EngineAirInletPressure is highly correlated with EngineSpeed and 3 other fields | High correlation |
AcceleratorPedalPos is highly correlated with EngineSpeed and 4 other fields | High correlation |
VehicleSpeed is highly correlated with EngineSpeed | High correlation |
BrakePedalPos is highly correlated with AcceleratorPedalPos | High correlation |
Fuel Rate is highly skewed (γ1 = 50.74352758) | Skewed |
Timestamp has unique values | Unique |
LongitudAcc has 5253570 (23.4%) zeros | Zeros |
EngineSpeed has 401981 (1.8%) zeros | Zeros |
Fuel Rate has 5140033 (22.9%) zeros | Zeros |
Engine Load has 5163040 (23.0%) zeros | Zeros |
Boost Pressure has 1474517 (6.6%) zeros | Zeros |
AcceleratorPedalPos has 8840773 (39.3%) zeros | Zeros |
VehicleSpeed has 3187013 (14.2%) zeros | Zeros |
BrakePedalPos has 18270026 (81.3%) zeros | Zeros |
Reproduction
| Analysis started | 2022-11-23 14:53:52.714253 |
|---|---|
| Analysis finished | 2022-11-23 15:21:46.290000 |
| Duration | 27 minutes and 53.58 seconds |
| Software version | pandas-profiling v3.4.0 |
| Download configuration | config.json |
| Distinct | 22483955 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.874165907 × 1010 |
| Minimum | 1.98526209 × 1010 |
|---|---|
| Maximum | 1.117845807 × 1011 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 171.5 MiB |
Quantile statistics
| Minimum | 1.98526209 × 1010 |
|---|---|
| 5-th percentile | 2.637722905 × 1010 |
| Q1 | 4.899890384 × 1010 |
| median | 7.091272126 × 1010 |
| Q3 | 9.187471026 × 1010 |
| 95-th percentile | 1.081962579 × 1011 |
| Maximum | 1.117845807 × 1011 |
| Range | 9.193195981 × 1010 |
| Interquartile range (IQR) | 4.287580643 × 1010 |
Descriptive statistics
| Standard deviation | 2.663656995 × 1010 |
|---|---|
| Coefficient of variation (CV) | 0.3874880285 |
| Kurtosis | -1.157031879 |
| Mean | 6.874165907 × 1010 |
| Median Absolute Deviation (MAD) | 2.14590836 × 1010 |
| Skewness | -0.1623286515 |
| Sum | 1.545584369 × 1018 |
| Variance | 7.095068586 × 1020 |
| Monotonicity | Strictly increasing |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 1.98526209 × 1010 | 1 | < 0.1% |
| 8.467566757 × 1010 | 1 | < 0.1% |
| 8.467567574 × 1010 | 1 | < 0.1% |
| 8.467567465 × 1010 | 1 | < 0.1% |
| 8.467567378 × 1010 | 1 | < 0.1% |
| 8.467567273 × 1010 | 1 | < 0.1% |
| 8.467567171 × 1010 | 1 | < 0.1% |
| 8.467567063 × 1010 | 1 | < 0.1% |
| 8.467566978 × 1010 | 1 | < 0.1% |
| 8.46756687 × 1010 | 1 | < 0.1% |
| Other values (22483945) | 22483945 |
| Value | Count | Frequency (%) |
| 1.98526209 × 1010 | 1 | |
| 1.985262163 × 1010 | 1 | |
| 1.98526228 × 1010 | 1 | |
| 1.985262399 × 1010 | 1 | |
| 1.985262466 × 1010 | 1 | |
| 1.985262586 × 1010 | 1 | |
| 1.985262699 × 1010 | 1 | |
| 1.985262866 × 1010 | 1 | |
| 1.985262986 × 1010 | 1 | |
| 1.985263061 × 1010 | 1 |
| Value | Count | Frequency (%) |
| 1.117845807 × 1011 | 1 | |
| 1.117845798 × 1011 | 1 | |
| 1.117845787 × 1011 | 1 | |
| 1.117845776 × 1011 | 1 | |
| 1.117845768 × 1011 | 1 | |
| 1.117845757 × 1011 | 1 | |
| 1.117845746 × 1011 | 1 | |
| 1.117845738 × 1011 | 1 | |
| 1.117845727 × 1011 | 1 | |
| 1.117845716 × 1011 | 1 |
WetTankAirPressure
Real number (ℝ≥0)
| Distinct | 205 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.97261414 |
| Minimum | 0 |
|---|---|
| Maximum | 14.0658 |
| Zeros | 62805 |
| Zeros (%) | 0.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 171.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 9.7909 |
| Q1 | 10.4804 |
| median | 11.10095 |
| Q3 | 11.65255 |
| 95-th percentile | 12.20415 |
| Maximum | 14.0658 |
| Range | 14.0658 |
| Interquartile range (IQR) | 1.17215 |
Descriptive statistics
| Standard deviation | 1.096493401 |
|---|---|
| Coefficient of variation (CV) | 0.09993000637 |
| Kurtosis | 37.85479854 |
| Mean | 10.97261414 |
| Median Absolute Deviation (MAD) | 0.5516 |
| Skewness | -4.468134083 |
| Sum | 246707762.6 |
| Variance | 1.202297779 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 10.8941 | 783203 | 3.5% |
| 11.032 | 776894 | 3.5% |
| 10.82515 | 763826 | 3.4% |
| 11.10095 | 758912 | 3.4% |
| 11.1699 | 751705 | 3.3% |
| 10.7562 | 733935 | 3.3% |
| 11.3078 | 724007 | 3.2% |
| 11.4457 | 714184 | 3.2% |
| 11.51465 | 707858 | 3.1% |
| 11.37675 | 707232 | 3.1% |
| Other values (195) | 15062199 |
| Value | Count | Frequency (%) |
| 0 | 62805 | |
| 0.06895 | 1185 | < 0.1% |
| 0.1379 | 613 | < 0.1% |
| 0.20685 | 1437 | < 0.1% |
| 0.2758 | 318 | < 0.1% |
| 0.34475 | 407 | < 0.1% |
| 0.4137 | 493 | < 0.1% |
| 0.48265 | 416 | < 0.1% |
| 0.5516 | 376 | < 0.1% |
| 0.62055 | 497 | < 0.1% |
| Value | Count | Frequency (%) |
| 14.0658 | 2 | < 0.1% |
| 13.99685 | 1 | < 0.1% |
| 13.9279 | 4 | < 0.1% |
| 13.85895 | 3 | < 0.1% |
| 13.79 | 11 | < 0.1% |
| 13.72105 | 11 | < 0.1% |
| 13.6521 | 26 | < 0.1% |
| 13.58315 | 46 | |
| 13.5142 | 74 | |
| 13.44525 | 106 |
| Distinct | 139 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.01471261173 |
| Minimum | -10 |
|---|---|
| Maximum | 13 |
| Zeros | 5253570 |
| Zeros (%) | 23.4% |
| Negative | 9283477 |
| Negative (%) | 41.3% |
| Memory size | 171.5 MiB |
Quantile statistics
| Minimum | -10 |
|---|---|
| 5-th percentile | -1 |
| Q1 | -0.2 |
| median | 0 |
| Q3 | 0.2 |
| 95-th percentile | 0.8 |
| Maximum | 13 |
| Range | 23 |
| Interquartile range (IQR) | 0.4 |
Descriptive statistics
| Standard deviation | 0.9786698016 |
|---|---|
| Coefficient of variation (CV) | 66.51910754 |
| Kurtosis | 125.3726755 |
| Mean | 0.01471261173 |
| Median Absolute Deviation (MAD) | 0.2 |
| Skewness | 9.528671171 |
| Sum | 330797.7 |
| Variance | 0.9577945806 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 5253570 | |
| -0.1 | 2196076 | |
| -0.2 | 1876597 | 8.3% |
| 0.1 | 1869076 | 8.3% |
| 0.2 | 1405793 | 6.3% |
| -0.3 | 1348372 | 6.0% |
| 0.3 | 1054275 | 4.7% |
| -0.4 | 927723 | 4.1% |
| 0.4 | 764429 | 3.4% |
| 0.5 | 617489 | 2.7% |
| Other values (129) | 5170555 |
| Value | Count | Frequency (%) |
| -10 | 1 | < 0.1% |
| -9 | 1 | < 0.1% |
| -8.6 | 1 | < 0.1% |
| -8.3 | 1 | < 0.1% |
| -7.9 | 1 | < 0.1% |
| -7.7 | 1 | < 0.1% |
| -7.3 | 3 | |
| -7.2 | 1 | < 0.1% |
| -7.1 | 1 | < 0.1% |
| -7 | 4 |
| Value | Count | Frequency (%) |
| 13 | 82274 | |
| 12.9 | 10823 | < 0.1% |
| 5.9 | 1 | < 0.1% |
| 5.8 | 1 | < 0.1% |
| 5.5 | 1 | < 0.1% |
| 5.4 | 2 | < 0.1% |
| 5.3 | 4 | < 0.1% |
| 5.2 | 10 | < 0.1% |
| 5.1 | 11 | < 0.1% |
| 5 | 15 | < 0.1% |
| Distinct | 13871 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1072.689465 |
| Minimum | 0 |
|---|---|
| Maximum | 8191.875 |
| Zeros | 401981 |
| Zeros (%) | 1.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 171.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 593.625 |
| Q1 | 892.625 |
| median | 1159 |
| Q3 | 1288 |
| 95-th percentile | 1462.875 |
| Maximum | 8191.875 |
| Range | 8191.875 |
| Interquartile range (IQR) | 395.375 |
Descriptive statistics
| Standard deviation | 322.250278 |
|---|---|
| Coefficient of variation (CV) | 0.3004133895 |
| Kurtosis | 4.392435126 |
| Mean | 1072.689465 |
| Median Absolute Deviation (MAD) | 158.5 |
| Skewness | -0.7061916685 |
| Sum | 2.411830165 × 1010 |
| Variance | 103845.2417 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 401981 | 1.8% |
| 600 | 68722 | 0.3% |
| 600.25 | 68589 | 0.3% |
| 599.75 | 67429 | 0.3% |
| 600.5 | 65542 | 0.3% |
| 599.5 | 65346 | 0.3% |
| 599.25 | 61294 | 0.3% |
| 600.875 | 59397 | 0.3% |
| 599 | 56840 | 0.3% |
| 600.125 | 56135 | 0.2% |
| Other values (13861) | 21512680 |
| Value | Count | Frequency (%) |
| 0 | 401981 | |
| 15.25 | 1 | < 0.1% |
| 15.875 | 1 | < 0.1% |
| 17.25 | 1 | < 0.1% |
| 17.875 | 1 | < 0.1% |
| 18.25 | 2 | < 0.1% |
| 18.875 | 1 | < 0.1% |
| 19.625 | 1 | < 0.1% |
| 20 | 1 | < 0.1% |
| 21 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 8191.875 | 350 | |
| 2230.625 | 1 | < 0.1% |
| 2227.625 | 1 | < 0.1% |
| 2226.125 | 1 | < 0.1% |
| 2210.375 | 1 | < 0.1% |
| 2195.375 | 1 | < 0.1% |
| 2183.375 | 1 | < 0.1% |
| 2169.25 | 1 | < 0.1% |
| 2158.875 | 2 | < 0.1% |
| 2157 | 1 | < 0.1% |
| Distinct | 1103 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.28743578 |
| Minimum | 0 |
|---|---|
| Maximum | 3876.198645 |
| Zeros | 5140033 |
| Zeros (%) | 22.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 171.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1.537822 |
| median | 7.984845 |
| Q3 | 21.706949 |
| 95-th percentile | 48.263952 |
| Maximum | 3876.198645 |
| Range | 3876.198645 |
| Interquartile range (IQR) | 20.169127 |
Descriptive statistics
| Standard deviation | 72.64006638 |
|---|---|
| Coefficient of variation (CV) | 4.751618744 |
| Kurtosis | 2693.574083 |
| Mean | 15.28743578 |
| Median Absolute Deviation (MAD) | 7.984845 |
| Skewness | 50.74352758 |
| Sum | 343722018.1 |
| Variance | 5276.579243 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 5140033 | 22.9% |
| 3.844555 | 166583 | 0.7% |
| 3.903702 | 159101 | 0.7% |
| 3.785408 | 153193 | 0.7% |
| 3.962849 | 141634 | 0.6% |
| 3.253085 | 131346 | 0.6% |
| 3.726261 | 126208 | 0.6% |
| 3.193938 | 125588 | 0.6% |
| 3.312232 | 123136 | 0.5% |
| 4.021996 | 119381 | 0.5% |
| Other values (1093) | 16097752 |
| Value | Count | Frequency (%) |
| 0 | 5140033 | |
| 0.059147 | 21547 | 0.1% |
| 0.118294 | 20927 | 0.1% |
| 0.177441 | 25702 | 0.1% |
| 0.236588 | 32903 | 0.1% |
| 0.295735 | 30165 | 0.1% |
| 0.354882 | 26707 | 0.1% |
| 0.414029 | 24411 | 0.1% |
| 0.473176 | 21133 | 0.1% |
| 0.532323 | 18387 | 0.1% |
| Value | Count | Frequency (%) |
| 3876.198645 | 7596 | |
| 3643.041171 | 1 | < 0.1% |
| 65.0617 | 44 | < 0.1% |
| 65.002553 | 82 | < 0.1% |
| 64.943406 | 154 | < 0.1% |
| 64.884259 | 162 | < 0.1% |
| 64.825112 | 120 | < 0.1% |
| 64.765965 | 145 | < 0.1% |
| 64.706818 | 205 | < 0.1% |
| 64.647671 | 132 | < 0.1% |
| Distinct | 201 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 30.86648245 |
| Minimum | 0 |
|---|---|
| Maximum | 100 |
| Zeros | 5163040 |
| Zeros (%) | 23.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 171.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 4 |
| median | 25 |
| Q3 | 45.5 |
| 95-th percentile | 93 |
| Maximum | 100 |
| Range | 100 |
| Interquartile range (IQR) | 41.5 |
Descriptive statistics
| Standard deviation | 28.10432456 |
|---|---|
| Coefficient of variation (CV) | 0.9105127092 |
| Kurtosis | -0.0253281456 |
| Mean | 30.86648245 |
| Median Absolute Deviation (MAD) | 20.5 |
| Skewness | 0.8693444933 |
| Sum | 694000602.5 |
| Variance | 789.853059 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 5163040 | 23.0% |
| 100 | 738391 | 3.3% |
| 22.5 | 322595 | 1.4% |
| 23 | 301261 | 1.3% |
| 22 | 288452 | 1.3% |
| 23.5 | 265812 | 1.2% |
| 19 | 253565 | 1.1% |
| 21.5 | 242857 | 1.1% |
| 24 | 239906 | 1.1% |
| 18.5 | 239370 | 1.1% |
| Other values (191) | 14428706 |
| Value | Count | Frequency (%) |
| 0 | 5163040 | |
| 0.5 | 91609 | 0.4% |
| 1 | 70251 | 0.3% |
| 1.5 | 55125 | 0.2% |
| 2 | 51482 | 0.2% |
| 2.5 | 47148 | 0.2% |
| 3 | 49573 | 0.2% |
| 3.5 | 47338 | 0.2% |
| 4 | 51994 | 0.2% |
| 4.5 | 48573 | 0.2% |
| Value | Count | Frequency (%) |
| 100 | 738391 | |
| 99.5 | 23649 | 0.1% |
| 99 | 29307 | 0.1% |
| 98.5 | 36721 | 0.2% |
| 98 | 31035 | 0.1% |
| 97.5 | 29328 | 0.1% |
| 97 | 27089 | 0.1% |
| 96.5 | 26489 | 0.1% |
| 96 | 26683 | 0.1% |
| 95.5 | 25350 | 0.1% |
| Distinct | 198 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.2409602234 |
| Minimum | 0 |
|---|---|
| Maximum | 1.697746 |
| Zeros | 1474517 |
| Zeros (%) | 6.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 171.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0.051708 |
| median | 0.12927 |
| Q3 | 0.336102 |
| 95-th percentile | 0.870418 |
| Maximum | 1.697746 |
| Range | 1.697746 |
| Interquartile range (IQR) | 0.284394 |
Descriptive statistics
| Standard deviation | 0.2858978553 |
|---|---|
| Coefficient of variation (CV) | 1.186493983 |
| Kurtosis | 3.745014936 |
| Mean | 0.2409602234 |
| Median Absolute Deviation (MAD) | 0.112034 |
| Skewness | 1.922574278 |
| Sum | 5417738.819 |
| Variance | 0.08173758365 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0.008618 | 1797794 | 8.0% |
| 0 | 1474517 | 6.6% |
| 0.094798 | 868531 | 3.9% |
| 0.017236 | 836343 | 3.7% |
| 0.08618 | 829829 | 3.7% |
| 0.103416 | 817354 | 3.6% |
| 0.077562 | 701942 | 3.1% |
| 0.112034 | 698850 | 3.1% |
| 0.120652 | 565666 | 2.5% |
| 0.068944 | 545634 | 2.4% |
| Other values (188) | 13347495 |
| Value | Count | Frequency (%) |
| 0 | 1474517 | |
| 0.008618 | 1797794 | |
| 0.017236 | 836343 | |
| 0.025854 | 517874 | 2.3% |
| 0.034472 | 418576 | 1.9% |
| 0.04309 | 377647 | 1.7% |
| 0.051708 | 367797 | 1.6% |
| 0.060326 | 421476 | 1.9% |
| 0.068944 | 545634 | 2.4% |
| 0.077562 | 701942 | 3.1% |
| Value | Count | Frequency (%) |
| 1.697746 | 2 | < 0.1% |
| 1.689128 | 3 | < 0.1% |
| 1.68051 | 12 | < 0.1% |
| 1.671892 | 12 | < 0.1% |
| 1.663274 | 16 | |
| 1.654656 | 18 | |
| 1.646038 | 12 | < 0.1% |
| 1.63742 | 19 | |
| 1.628802 | 21 | |
| 1.620184 | 32 |
| Distinct | 102 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 125.267869 |
| Minimum | 34 |
|---|---|
| Maximum | 510 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 171.5 MiB |
Quantile statistics
| Minimum | 34 |
|---|---|
| 5-th percentile | 102 |
| Q1 | 106 |
| median | 114 |
| Q3 | 134 |
| 95-th percentile | 188 |
| Maximum | 510 |
| Range | 476 |
| Interquartile range (IQR) | 28 |
Descriptive statistics
| Standard deviation | 28.66608806 |
|---|---|
| Coefficient of variation (CV) | 0.2288383149 |
| Kurtosis | 4.227496827 |
| Mean | 125.267869 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | 1.943822648 |
| Sum | 2816517130 |
| Variance | 821.7446044 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 102 | 2433311 | 10.8% |
| 110 | 1826591 | 8.1% |
| 112 | 1699190 | 7.6% |
| 104 | 1463657 | 6.5% |
| 108 | 1275522 | 5.7% |
| 114 | 1130552 | 5.0% |
| 106 | 956562 | 4.3% |
| 116 | 873347 | 3.9% |
| 100 | 811680 | 3.6% |
| 118 | 660129 | 2.9% |
| Other values (92) | 9353414 |
| Value | Count | Frequency (%) |
| 34 | 14 | |
| 50 | 5 | < 0.1% |
| 52 | 10 | |
| 66 | 3 | < 0.1% |
| 68 | 20 | |
| 70 | 4 | < 0.1% |
| 82 | 2 | < 0.1% |
| 84 | 20 | |
| 86 | 13 | |
| 92 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 510 | 368 | < 0.1% |
| 508 | 11 | < 0.1% |
| 272 | 3 | < 0.1% |
| 270 | 14 | < 0.1% |
| 268 | 40 | < 0.1% |
| 266 | 39 | < 0.1% |
| 264 | 67 | < 0.1% |
| 262 | 185 | < 0.1% |
| 260 | 535 | |
| 258 | 1276 |
| Distinct | 251 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 37.84106355 |
| Minimum | 0 |
|---|---|
| Maximum | 100 |
| Zeros | 8840773 |
| Zeros (%) | 39.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 171.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 40.8 |
| Q3 | 67.2 |
| 95-th percentile | 100 |
| Maximum | 100 |
| Range | 100 |
| Interquartile range (IQR) | 67.2 |
Descriptive statistics
| Standard deviation | 35.4232917 |
|---|---|
| Coefficient of variation (CV) | 0.9361071908 |
| Kurtosis | -1.41225725 |
| Mean | 37.84106355 |
| Median Absolute Deviation (MAD) | 40.8 |
| Skewness | 0.2272119187 |
| Sum | 850816770 |
| Variance | 1254.809595 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 8840773 | |
| 100 | 1209694 | 5.4% |
| 62 | 106055 | 0.5% |
| 59.6 | 104698 | 0.5% |
| 65.2 | 104646 | 0.5% |
| 62.4 | 104562 | 0.5% |
| 63.6 | 104307 | 0.5% |
| 61.2 | 104002 | 0.5% |
| 60.8 | 103702 | 0.5% |
| 64 | 103100 | 0.5% |
| Other values (241) | 11598416 |
| Value | Count | Frequency (%) |
| 0 | 8840773 | |
| 0.4 | 8847 | < 0.1% |
| 0.8 | 9062 | < 0.1% |
| 1.2 | 9173 | < 0.1% |
| 1.6 | 9125 | < 0.1% |
| 2 | 9324 | < 0.1% |
| 2.4 | 9322 | < 0.1% |
| 2.8 | 9957 | < 0.1% |
| 3.2 | 9626 | < 0.1% |
| 3.6 | 10014 | < 0.1% |
| Value | Count | Frequency (%) |
| 100 | 1209694 | |
| 99.6 | 31026 | 0.1% |
| 99.2 | 30090 | 0.1% |
| 98.8 | 30592 | 0.1% |
| 98.4 | 31648 | 0.1% |
| 98 | 31407 | 0.1% |
| 97.6 | 31924 | 0.1% |
| 97.2 | 32373 | 0.1% |
| 96.8 | 32153 | 0.1% |
| 96.4 | 33273 | 0.1% |
| Distinct | 1080 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 37.22274917 |
| Minimum | 0 |
|---|---|
| Maximum | 255.97971 |
| Zeros | 3187013 |
| Zeros (%) | 14.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 171.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 16.397388 |
| median | 39.095154 |
| Q3 | 56.594034 |
| 95-th percentile | 75.592818 |
| Maximum | 255.97971 |
| Range | 255.97971 |
| Interquartile range (IQR) | 40.196646 |
Descriptive statistics
| Standard deviation | 24.93746284 |
|---|---|
| Coefficient of variation (CV) | 0.669952204 |
| Kurtosis | 0.5183566949 |
| Mean | 37.22274917 |
| Median Absolute Deviation (MAD) | 20.09637 |
| Skewness | 0.1709047565 |
| Sum | 836914617.2 |
| Variance | 621.877053 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 3187013 | 14.2% |
| 48.496896 | 38861 | 0.2% |
| 47.793816 | 37822 | 0.2% |
| 47.395404 | 37681 | 0.2% |
| 47.59461 | 37391 | 0.2% |
| 46.996992 | 37328 | 0.2% |
| 48.094578 | 37304 | 0.2% |
| 47.196198 | 37274 | 0.2% |
| 49.9968 | 37115 | 0.2% |
| 50.09445 | 36983 | 0.2% |
| Other values (1070) | 18959183 |
| Value | Count | Frequency (%) |
| 0 | 3187013 | |
| 0.999936 | 5072 | < 0.1% |
| 1.097586 | 5987 | < 0.1% |
| 1.199142 | 7236 | < 0.1% |
| 1.296792 | 7799 | < 0.1% |
| 1.398348 | 8275 | < 0.1% |
| 1.499904 | 9102 | < 0.1% |
| 1.597554 | 12402 | 0.1% |
| 1.69911 | 9696 | < 0.1% |
| 1.79676 | 10142 | < 0.1% |
| Value | Count | Frequency (%) |
| 255.97971 | 346 | < 0.1% |
| 255.975804 | 6019 | |
| 112.289688 | 1 | < 0.1% |
| 112.192038 | 1 | < 0.1% |
| 112.090482 | 1 | < 0.1% |
| 111.590514 | 1 | < 0.1% |
| 111.391308 | 2 | < 0.1% |
| 111.289752 | 1 | < 0.1% |
| 111.192102 | 2 | < 0.1% |
| 111.090546 | 2 | < 0.1% |
| Distinct | 244 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.437663454 |
| Minimum | 0 |
|---|---|
| Maximum | 97.2 |
| Zeros | 18270026 |
| Zeros (%) | 81.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 171.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 22.4 |
| Maximum | 97.2 |
| Range | 97.2 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 7.869066842 |
|---|---|
| Coefficient of variation (CV) | 2.289074235 |
| Kurtosis | 4.764316588 |
| Mean | 3.437663454 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.250963228 |
| Sum | 77292270.4 |
| Variance | 61.92221296 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 18270026 | |
| 15.6 | 182889 | 0.8% |
| 16 | 173874 | 0.8% |
| 16.4 | 149439 | 0.7% |
| 16.8 | 148079 | 0.7% |
| 17.2 | 135796 | 0.6% |
| 15.2 | 132112 | 0.6% |
| 17.6 | 125249 | 0.6% |
| 18 | 107043 | 0.5% |
| 20.8 | 93141 | 0.4% |
| Other values (234) | 2966307 | 13.2% |
| Value | Count | Frequency (%) |
| 0 | 18270026 | |
| 0.4 | 57112 | 0.3% |
| 0.8 | 29574 | 0.1% |
| 1.2 | 18586 | 0.1% |
| 1.6 | 16743 | 0.1% |
| 2 | 15114 | 0.1% |
| 2.4 | 16156 | 0.1% |
| 2.8 | 15778 | 0.1% |
| 3.2 | 14263 | 0.1% |
| 3.6 | 15520 | 0.1% |
| Value | Count | Frequency (%) |
| 97.2 | 82 | < 0.1% |
| 96.8 | 374 | |
| 96.4 | 20 | < 0.1% |
| 96 | 6 | < 0.1% |
| 95.6 | 7 | < 0.1% |
| 95.2 | 20 | < 0.1% |
| 94.8 | 28 | < 0.1% |
| 94.4 | 28 | < 0.1% |
| 94 | 9 | < 0.1% |
| 93.6 | 6 | < 0.1% |
Auto
The auto setting is an easily interpretable pairwise column metric of the following mapping: vartype-vartype : method, categorical-categorical : Cramer's V, numerical-categorical : Cramer's V (using a discretized numerical column), numerical-numerical : Spearman's ρ. This configuration uses the best suitable for each pair of columns.Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here. A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
First rows
| Timestamp | WetTankAirPressure | LongitudAcc | EngineSpeed | Fuel Rate | Engine Load | Boost Pressure | EngineAirInletPressure | AcceleratorPedalPos | VehicleSpeed | BrakePedalPos | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1.985262e+10 | 6.27445 | 0.0 | 598.125 | 6.683611 | 38.5 | 0.0 | 102.0 | 0.0 | 0.0 | 0.0 |
| 1 | 1.985262e+10 | 6.27445 | 0.0 | 606.250 | 6.801905 | 38.5 | 0.0 | 104.0 | 0.0 | 0.0 | 0.0 |
| 2 | 1.985262e+10 | 6.27445 | 0.0 | 601.875 | 6.742758 | 38.5 | 0.0 | 102.0 | 0.0 | 0.0 | 0.0 |
| 3 | 1.985262e+10 | 6.27445 | 0.0 | 600.375 | 6.801905 | 39.0 | 0.0 | 102.0 | 0.0 | 0.0 | 0.0 |
| 4 | 1.985262e+10 | 6.27445 | 0.0 | 596.375 | 6.565317 | 37.5 | 0.0 | 102.0 | 0.0 | 0.0 | 0.0 |
| 5 | 1.985263e+10 | 6.27445 | 0.0 | 602.250 | 6.565317 | 37.5 | 0.0 | 102.0 | 0.0 | 0.0 | 0.0 |
| 6 | 1.985263e+10 | 6.27445 | 0.0 | 601.000 | 6.683611 | 38.0 | 0.0 | 102.0 | 0.0 | 0.0 | 0.0 |
| 7 | 1.985263e+10 | 6.34340 | 0.0 | 597.875 | 6.742758 | 38.5 | 0.0 | 102.0 | 0.0 | 0.0 | 0.0 |
| 8 | 1.985263e+10 | 6.34340 | 0.0 | 597.250 | 6.624464 | 38.0 | 0.0 | 102.0 | 0.0 | 0.0 | 0.0 |
| 9 | 1.985263e+10 | 6.34340 | 0.0 | 604.500 | 6.801905 | 39.0 | 0.0 | 102.0 | 0.0 | 0.0 | 0.0 |
Last rows
| Timestamp | WetTankAirPressure | LongitudAcc | EngineSpeed | Fuel Rate | Engine Load | Boost Pressure | EngineAirInletPressure | AcceleratorPedalPos | VehicleSpeed | BrakePedalPos | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 22483945 | 1.117846e+11 | 10.27355 | -0.1 | 615.625 | 4.021996 | 22.0 | 0.017236 | 104.0 | 0.0 | 2.699046 | 1.6 |
| 22483946 | 1.117846e+11 | 10.34250 | -0.3 | 600.125 | 3.903702 | 22.5 | 0.017236 | 104.0 | 0.0 | 2.097522 | 0.4 |
| 22483947 | 1.117846e+11 | 10.41145 | -0.3 | 596.500 | 4.436025 | 26.0 | 0.017236 | 104.0 | 0.0 | 1.398348 | 0.0 |
| 22483948 | 1.117846e+11 | 10.41145 | 0.0 | 603.750 | 4.258584 | 24.5 | 0.017236 | 104.0 | 14.4 | 0.000000 | 0.0 |
| 22483949 | 1.117846e+11 | 10.20460 | 0.0 | 603.125 | 5.441524 | 33.0 | 0.008618 | 104.0 | 2.4 | 0.000000 | 0.0 |
| 22483950 | 1.117846e+11 | 10.34250 | -0.2 | 619.875 | 6.861052 | 44.5 | 0.017236 | 104.0 | 0.0 | 1.097586 | 0.0 |
| 22483951 | 1.117846e+11 | 10.34250 | 0.0 | 595.375 | 4.672613 | 27.5 | 0.017236 | 104.0 | 0.0 | 0.000000 | 21.2 |
| 22483952 | 1.117846e+11 | 10.34250 | 0.0 | 603.250 | 3.903702 | 22.0 | 0.017236 | 104.0 | 0.0 | 0.000000 | 9.2 |
| 22483953 | 1.117846e+11 | 10.41145 | 0.0 | 586.000 | 4.317731 | 25.0 | 0.017236 | 104.0 | 0.0 | 0.000000 | 0.0 |
| 22483954 | 1.117846e+11 | 10.48040 | 0.0 | 609.625 | 4.081143 | 23.5 | 0.017236 | 104.0 | 0.0 | 0.000000 | 0.0 |